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Abstract — v 

We here propose a segmentation algorithm of texture image for com- 
puter vision system on space robot. An improved Adaptive Reso- 
nance Theory (ART2) for analog input patterns is adapted to clas- 
sify the image based on a set of texture image features extracted by 
a fast Spatial Gray Level Dependence Method (SGLDM). The non- _ 
linear thresholding functions in input layer of the neural network _ 
have been constructed by two parts: firstly to reduce the effection of 
image noises on the features, a set of sigmoid functions is chosen de- ^ 
pending on the types of the feature; secondly, to enhence the contrast , 
of the features, we adopt fuzzy mapping functions The cluster lunn- ^ 
her in output layer can be increased by an autogrowing mechanism 
constantly when a new pattern happens. Experimental results and _ 
orginal or segmented pictures are shown, including the comparison 
between this approach and K-means algorithm. The system written 
by C language is performed on a SUN-4/330 spare-station with an 
image board IT- 150 and a CCD camera. 

1. Introduction 

Segment ation and classification of textured images have been consid- 
erable attention to contain significant discriminatory information for 
image segmentation in a variety of application, such as terrain classi- 
fication, military surveillance and recogirition, remote sensing images 
and biomedical image analysis^ . Although texture is a fundamental 
characteristic of images , the complexity involved in its quantifica- 
tion has presented its effective incorporation into the segmentation 
process. 

in this paper, the neural network of an improved Adaptive Res- 
onance Theory ( ART2) is presented to segment an image consisting 
of several region.s with different textures. Artificial neural netwoiks 
offer several advantages over conventional classification techniques, 
due to their high computation rate, great degree of fault tolerance 
and unsupervised ability. The number of researches have engaged on 
the researchment by neural networks^'^'^^U 

In this paper, section 2 defines the texture feature types which 
are derived from co-occurrence matrixes and selection of maximum 
and minimum measure window for feature extraction of the texture 
image. Section 3 describes an approach of improved ART2 neural 
network with alterable competitive layer (Fn layer). The nonlinear 
thresholding fuction in Fj layer is displaced by a fuzzy mapping 


function. Section 4 shows the results of experiments and illustration. 

2. Feature extraction of texture image 

Whether the segmentation of texture image is good or not depends 
on the extraction of texture features. There are number of the ap- 
proaches to have been developed for feature extraction of the text ure 
image: Fourier power spectrum method (FPSM)^, spatial texture 
energy®, Markov random field modef , Gibbs random field model®'^, 
zero-sum filter masks gray level run length method (GLRLM) 

spatial gray level dependece method (SGLDM )^, gray level dif- 
ference method (GLDM) and other methods 6,i2,i3,i5 of 

these methods belong to statistical method, others to structural one. 
Among them, spatial gray level dependence method, which is intro- 
duced by Haralick et al. in their paper^, is one of the most .successful 
.statistical representation for the texture. The feature measurement 
from cooccurrence matrices in the SGLDM is rather similar to the 
knowledge captured by the human eyes, and provides a convenient 
w’^ay to represent the properties of object textures. Weszka et al. 
experimentally compared feature on terrain images and fouiul that 
SGLDM is more powerful than the GLDM, GRLM, and FPSM 
Ohanian et ai also pointted that the features by SGLDM were bet- 
ter than Markov random field, multi-chaiiiiel filtering features, and 
fractal based features^®. It is known, however, that the SGLDM 
requires much |>rocessing time and great number of memory. Only 
for mean probability distribution, 2^^* times of multijdication in the 
SGLDM are done when a measured image is a size 64 x 64 with gray 
level 128, and the tendency will be raised at exponent rate with the 
enlargement of the image size, particularly, the iucrea.se of gray-level 
number. 

In tliis paper, we use a set of simplifed equations leased on a 
fact that rows or columns around the current pixel are included or 
excluded almostly at the same time while the measured wdndo\v is 
displaced in the liorizontal or vertical direction of tlie image, so we 
could make the equation be simplifed viewing from the pixels of 
row- s{ columns) both excluded and included from a w'indow rather 
than a pixel method Some calculations are done one time in a 
row or column instead of one in a pixel, so algorithm in the paper 
consumes much less time than Harilick's method. 

We defined a co-occurrence matrix of relative frequencies with 
whicli two pixels separated by distance d at a specified angle occur 
on the image, one with gray level i and the other with gray level j. 
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A distance of one pixel, i,e. the measuring window slides over the 
image in one step length^ in both liorizontal and vertical direction 
and angle quantixed to 45® intervals, or 0®, 45®, 90®, and 135® will 
be used. We give a set of simplified equations: 

(1) Mean 

= '"*-1 + -A, r) (1) 

(2) Variajice 

= <^Li^ 

]V (2) 

(3) Correlation 


Ct = lCt.,al.,+ 



2 T — ^ . , 

- + ’»t-i - ’nl]/<^k 

(3) 

(4) Energy 
Let 



r- 
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J- = l/(-V,_, - 2) 


i+_ 

+ 2 

= l/(lVi-i + 2) 



+ (J-)=(a-4A/,-.,) 
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(6) Contrast 

n = n., + 1 - jirf - o) 

(7) Homogeneity 

Hk = Ht.i + 

I E « 1 + ('t r (10) 

where the ^ is the ^ stands for the length of the row 

or colunm, i.e. the wide of measuring square window, the M(ij) 
is the element of a co-occurrence matrix, superscipts “d-” and 
express for a pixel (x,y) included or excluded from the window. The 
equations for both energy and entropy features are used to the case 
considering a pixel included or excluded from the window because of 
the nonlinear decomposition for square and logarithm functions 

3. Improved ART2 

Connectionist cTasslffcation used here is called Adaptive ^sonance 
Theory(ART) In general, ART is divided into two types de- 

pending on input patterns. ARTl is applied to solve binary input 
problem, ART2 i.s available to both binary and analog inputs. In the 
paper, the ART2 is used to classify the texture image because the 
20 features (five for each angle) belong to gray-scale patterns. 

The classifier in the ART2 consists mainly of two subsystems: the 
attentional subsystem and the orienting subsystem. The former is 
composed of the Short-Term Memory (STM) and Long-Term Mem- 
ory (LTM) elements. 


E+ = (i+)»F+_. + (J+)"(a + 

where 


(5) Entropy 





3.1 Short-Term Memory (STM) 

(5) The the input representation field, and F^, the category rep- 
resentation field(competitive mechanism), are the two STM main 
components. 

composed of three layers with STM activation equations as 
^ ^ (see Fig. 1 ) 

p. = «, (11) 


=L'£'P,1,-i-log(i-)-,4- (7) 


EP+ = L*EP*_, - L* log(£+) - A+ (8) 
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(13) 

(14) 


w^ = /| -H au, 

Wt 


e+ «.• 


(15) 

(16) 


where a,b, and e are constants, y^ is the STM activation of the Jth 
A neuron, |( || is the L 2 norm, /() is a nonlinear threshold function: 


( 2J+(A/t_ ,(f+^ , A^)+ l)log( i+(A*_ ,(i+„ ;■+,)+ 1) 

4+,,! 2J+Mt_#+^J+>g(rH/,.,(,+^J+.)' ' ijij 

I •^■^(A^i-i(iLJitr)+2)log(i+(A/,.i(f+,J+,)+2) 

I log(i+ A/* . , ( i+ ^ : J+ . ’) i =j 

^ Ordinaniy. rather than using a single displacement berauee small values for 
step length d yield the best results for the extraction of image features proved 
bv VVeszka at, al.^ 


f{r] 


0 for 0 < X < 0 
fore<x<oi 

^“2(|^)2 fora<x<i3 

1 for X > P 


(17) 


where the feature noises are suppressed by seting f(z) to zero when 
^ Tlie fuzzy mapping function is used to enhence the 

contrast among the features, and makes the iimui oatterns classified 
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Fig. 1. Typical ART2 architecture 25,26,27 

more easily. The normalization mechanism keeps the pattern from 
saturation in spite of the constant presence of the pattern during 
the learning process. The F\ layer provides internal feedback and 
a correlation between normalized bottom-up and top-down patterns 
to stabilize all activities in the STM before transmitting the output 
of the F\ layer to the Fj layer. 

3.2 The search phase 

In the F 2 , a competitive mechanism is used to choose a winning 
neuron. Firstly, the input pattern of the Fi is applied to the bottom- 
up adaptive filter by the bottonvup adaptive weight Z,j. 

T, = J2p‘^’J forj = (18) 

where K is the total number of existing categories in the F 2 , then 
the vector T is put in the order from minimum value to maximum 
one. We here suppose the Jth neuron in the F 2 is selected if this 
neuron becomes maximally active one among the neurons not to be 
reset in the trial, i.e. 

Tj = max{Tj) j^l.2 Kj (19) 

where Kj is the total number of the categories not to be used, then 
only winning neuron in the F 2 has nonzero outputs. 

{ d if the Jth F 2 neuron w the winner based on 

max(YlpiZij) and it has not been reset in the trial 
0 otherwise 

( 20 ) 

The top-down pattern g{Tj) is then feedback to the Fi by top-down 
adaptive weight Zj, and compared to the original bottom-up pattern 
to see if a correct match has been made by an activated orienting 
subsystem. 

3.3 Orienting subsystem 

The orienting subsystem helps to directly search for the categories 
in the Fo . When the subsvstem is activated, tlie bottom-uo nattern 


vector p and the top-down pattern vector u are utilized to calculate 
the degree of match (vector r) 

_ / 21 ) 
if the choise in the F 2 is correct, i.e. 

(II ’’ ll> P (22) 

where p stands for the vigilance factor or match sensitivity parame- 
ter. At this time, adaptive resonance is considered to have occured 
and entered to the categories in the Long-Term Memory (LTM). If 
the choice is incorrect, another neuron with maximum output value 
among the existing neurons not to be selected will be selected as a 
possible winner candidate. The new candidate may cause yet an- 
other mismatch, lienee another reset happens and the selection of 
yet another neuron, eventually, either the bottom-up pattern will be 
placed in an existing category or learned as the first example of a 
new category in the F 2 layer. It is possible for an autogrowing mech- 
anism to be activated to create a new category if no category in the 
F 2 could be used to save the new one. 

3.4 The Long-Term Memory (LTM) 

The LTM is made up of two components, the bottom-up adap- 
tive weight Zij and the top-down adaptive weight Zji, When the 
match operation in the orienting subsystem occures successfully, the 
bottom-up and top-down weights should be adjusted. The weights 
can been obtained easily by 

= ^ (23) 

^- = 7^ (24) 

The procedure in the improved ART2 can be summarized as: 
Step 1. Initialize bottom-up and top-down adaptive weights Zij and 
Zji in the LTM. 

Step 2. Apply a new input pattern. 

Step 3, Stabilize the output vectors u(or p) of the Fi layer by 
repeated operating Eqs. 11 ~ 16, including noise reduction and con- 
trast enhencement by a nonlinear thresholding function and fuzzy 
mapping function. 

Step 4- Compute the output vector p by Eq. 18. 

Step 5. Select a winner neuron by Eqs. 18 and 19 if neurons not to 
be selected exist, else go to step 7. 

Step 6. Apply Eq. 21 to determine whether the selected top-down 
winner pattern matches the bottom-up input u within a certain ac- 
cepiance level of vigilance, if Eq. 2i is not true, the selected winner 
neuron in the F 2 is disabled and return to step 5 in order to choose 
another winner neuron ; else go to step 8; 

Step 7. Autogrowing mechanism is activated to create a new cate- 
gory. 

Step 8. Only adjust the bottom-up and top-down adaptive weights 
with respect to the matched winner neuron by Eqs. 23 and 24. 

Step 9. Before taking the next new input pattern, neuron which has 
been disable int step 6 will be enabled. The process return to step 2 
if a new input pattern at least exists, else exit the system. 

Viewing from the improved ART2 algorithm, if the network for 
an input patten has learned previously to recognize the pattern, then 
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(C) 


(d) 


Fig. 2. (a) original, nat ural texture image, (b) segmentation by K-means algorilhin, 
(c) energy segmentation by the improved ART2 only with noise reduction, (d) en- 
ergy segmentation by the improved ART2 with both noise reduction and contrast 
en hen cement. 


a resonant state will be achieved quickly when that pattern is pre- 
sented, ada)>t ive process will reinforce the memory of the stored pat- 
tern by formulas. If the pattern is not inmiediately recognized, the 
network will rapidly search througli its stored patterns looking for a 
match. If no match is found, it will enter a resonant state whereupon 
the ]iattern will be stored as a new category for the first tinre 


if unused neurons in the com]3etitive layer exist. Otherwise, a new 
neuron should be created automatically by the autogrowing mech- 
anism tlie Fi layer to store the new pattern. Thus, the network Is 
able to respoiul fastly to previously learned data, yet learn novel 
data when tliose are presented. 
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4. Experiment results 

The performance of the segment algor itlini by improved ART2 is 
examined by a series of experiments on image containing different 
texture. The size of each image is 100 x 100 with 256 gray levels. 
The size of maximum and minimum measuring window is defined as 
11x11 and 33 x 33, respectively. 

For the texture features from tlie image by fast SGLDM algo- 
rithm, the K-means algorithm is used (shown in Fig. 2 (b)). 
How'ever, the K-means algorithm has following disavantages: 

• Supervised learning mode; the number of clusters Jiiust be set 
in advance, the different number may classify different results; 

• Slow real-time ability: time of classification will raise at expo- 
nent rate with the cluster number increased; 

• Uiistability: the results of classification depends on the preci- 
sion of feature extraction, when the extraction of the texture 
features has slightly change, the classifing result might be dif- 
ference. 

Compared to the K-means algorithm, the ART2 has many advan- 
tages, such as unsupervised training, high computation rates, and 
great degree of fault tolerance (stalility/pleisticlty). 

In our test, the features, i.e. energy, entropy, correlation, ho- 
mogeneity and inertia (or called as contrast), are used in texture 
analysis. The features have been proved to be a high degree of accu- 
racy for the extraction of texture image features^. The parameters 
a, b, c, d, e, 9 and p is selected in advance. a=b=10., c=0.25, d=0.8, 
e=I0“®. the selection of 9 depends on different texture features and 
quantized angles of the features. For instance, tlie noise of each an- 
gle for the energy feature in the test is similar, so the value of 9 is 
selected as 0.23 in every angle of the feature. On the other hand, the 
noise of eacli angle for the contrast feature is slightly different, the 9 
is set to 0.1, 0.12, 0.2, and 0.1 for the feature along to angle 0®, 45®, 
90®, and 135®, respectively. The Fig 2. (a) is the original texture 
image. The Fig. 2. (c) is the segmenting result of the improved 
ART2 only with noise reduction. It is seen from the Fig. 2. (c) to 
greatly improve the segmentation of the texture image. The Fig. 2. 
(d) shows that the segmentation operation is further good after not 
only the noise reduction but also the feature enhencement are done. 

5. Conclusion 

The SGLDM provides the most powerful statistical representation 
for segmentation and identification of texture images. Its problem, 
consuming time has been improved greatly by a fast algorithm. 

An improved Adapihe Resonance Theory (ART2) for analog in- 
put patterns is adapted to classify the image based oii a set of texture 
image features extracted by a fast SGLDM. The non-linear thresh- 
olding functions in the ART2 Fj layer have been composed of two 
parts: to reduce the effection of image noises on the features, a set 
of sigmoid functions is chosen depending on t he types of the feature; 
to eiiheiice the contrast of the features, we adopt fuzzy mulU-region 
mapping funciions The cluster number in output layer can be in- 
creased by an autogrowing mechanism constantly when a new pat- 
tern happens. The system written by C language is performed on 
a SUN-4/330 spare-station with an image board IT- 150 and a CCD 
camera. 
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